Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 100000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 12.2 MiB |
| Average record size in memory | 128.0 B |
Variable types
| NUM | 12 |
|---|---|
| CAT | 4 |
AvgClaimAmount is highly correlated with ClaimAmount | High correlation |
ClaimAmount is highly correlated with AvgClaimAmount | High correlation |
ClaimAmount is highly skewed (γ1 = 50.42130238) | Skewed |
PurePremium is highly skewed (γ1 = 261.537418) | Skewed |
Frequency is highly skewed (γ1 = 51.04192076) | Skewed |
AvgClaimAmount is highly skewed (γ1 = 51.65522984) | Skewed |
IDpol has unique values | Unique |
ClaimNb has 95262 (95.3%) zeros | Zeros |
VehAge has 11656 (11.7%) zeros | Zeros |
ClaimAmount has 95262 (95.3%) zeros | Zeros |
PurePremium has 95262 (95.3%) zeros | Zeros |
Frequency has 95262 (95.3%) zeros | Zeros |
AvgClaimAmount has 95262 (95.3%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-24 19:40:04.840049 |
|---|---|
| Analysis finished | 2020-09-24 19:40:30.733104 |
| Duration | 25.89 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 100000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 210627.3501 |
|---|---|
| Minimum | 1 |
| Maximum | 1019556 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 11352.8 |
| Q1 | 53369.75 |
| median | 101265 |
| Q3 | 157101.25 |
| 95-th percentile | 1013021.05 |
| Maximum | 1019556 |
| Range | 1019555 |
| Interquartile range (IQR) | 103731.5 |
Descriptive statistics
| Standard deviation | 312518.4398 |
|---|---|
| Coefficient of variation (CV) | 1.483750518 |
| Kurtosis | 2.60753771 |
| Mean | 210627.3501 |
| Median Absolute Deviation (MAD) | 51909 |
| Skewness | 2.09969199 |
| Sum | 2.106273501e+10 |
| Variance | 9.766777521e+10 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 4094 | 1 | < 0.1% | |
| 156313 | 1 | < 0.1% | |
| 177895 | 1 | < 0.1% | |
| 1000061 | 1 | < 0.1% | |
| 1006206 | 1 | < 0.1% | |
| 1004159 | 1 | < 0.1% | |
| 174720 | 1 | < 0.1% | |
| 47746 | 1 | < 0.1% | |
| 45699 | 1 | < 0.1% | |
| 140041 | 1 | < 0.1% | |
| Other values (99990) | 99990 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1019556 | 1 | < 0.1% | |
| 1019554 | 1 | < 0.1% | |
| 1019552 | 1 | < 0.1% | |
| 1019550 | 1 | < 0.1% | |
| 1019549 | 1 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04945 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 95262 |
| Zeros (%) | 95.3% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2268153697 |
|---|---|
| Coefficient of variation (CV) | 4.586761774 |
| Kurtosis | 25.44775027 |
| Mean | 0.04945 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.802556371 |
| Sum | 4945 |
| Variance | 0.05144521195 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1 | 4544 | 4.5% | |
| 2 | 183 | 0.2% | |
| 3 | 9 | < 0.1% | |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1 | 4544 | 4.5% | |
| 2 | 183 | 0.2% | |
| 3 | 9 | < 0.1% | |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4 | 2 | < 0.1% | |
| 3 | 9 | < 0.1% | |
| 2 | 183 | 0.2% | |
| 1 | 4544 | 4.5% | |
| 0 | 95262 | 95.3% |
Exposure
Real number (ℝ≥0)
| Distinct | 96 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5508033494 |
|---|---|
| Minimum | 0.002732240437 |
| Maximum | 0.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0.002732240437 |
|---|---|
| 5-th percentile | 0.04 |
| Q1 | 0.22 |
| median | 0.59 |
| Q3 | 0.9 |
| 95-th percentile | 0.9 |
| Maximum | 0.9 |
| Range | 0.8972677596 |
| Interquartile range (IQR) | 0.68 |
Descriptive statistics
| Standard deviation | 0.3314737002 |
|---|---|
| Coefficient of variation (CV) | 0.6018004439 |
| Kurtosis | -1.520867805 |
| Mean | 0.5508033494 |
| Median Absolute Deviation (MAD) | 0.31 |
| Skewness | -0.2742961367 |
| Sum | 55080.33494 |
| Variance | 0.1098748139 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.9 | 35077 | 35.1% | |
| 0.03 | 1723 | 1.7% | |
| 0.04 | 1603 | 1.6% | |
| 0.12 | 1515 | 1.5% | |
| 0.24 | 1412 | 1.4% | |
| 0.08 | 1403 | 1.4% | |
| 0.07 | 1387 | 1.4% | |
| 0.06 | 1343 | 1.3% | |
| 0.16 | 1254 | 1.3% | |
| 0.09 | 1238 | 1.2% | |
| Other values (86) | 52045 | 52.0% |
| Value | Count | Frequency (%) | |
| 0.002732240437 | 474 | 0.5% | |
| 0.002739726027 | 80 | 0.1% | |
| 0.005464480874 | 232 | 0.2% | |
| 0.005479452055 | 38 | < 0.1% | |
| 0.008196721311 | 309 | 0.3% |
| Value | Count | Frequency (%) | |
| 0.9 | 35077 | 35.1% | |
| 0.89 | 399 | 0.4% | |
| 0.88 | 237 | 0.2% | |
| 0.87 | 832 | 0.8% | |
| 0.86 | 312 | 0.3% |
Area
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.2 KiB |
| C | |
|---|---|
| D | |
| A | |
| E | |
| B |
| Value | Count | Frequency (%) | |
| C | 29128 | 29.1% | |
| D | 21750 | 21.8% | |
| A | 17513 | 17.5% | |
| E | 16995 | 17.0% | |
| B | 12231 | 12.2% | |
| F | 2383 | 2.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
VehPower
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.3213 |
|---|---|
| Minimum | 4 |
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 15 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.908114965 |
|---|---|
| Coefficient of variation (CV) | 0.3018548344 |
| Kurtosis | 1.894160016 |
| Mean | 6.3213 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.190522156 |
| Sum | 632130 |
| Variance | 3.640902719 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 6 | 24087 | 24.1% | |
| 7 | 21697 | 21.7% | |
| 5 | 19838 | 19.8% | |
| 4 | 16600 | 16.6% | |
| 8 | 4883 | 4.9% | |
| 9 | 4750 | 4.8% | |
| 10 | 4537 | 4.5% | |
| 11 | 2018 | 2.0% | |
| 12 | 701 | 0.7% | |
| 13 | 368 | 0.4% | |
| Other values (2) | 521 | 0.5% |
| Value | Count | Frequency (%) | |
| 4 | 16600 | 16.6% | |
| 5 | 19838 | 19.8% | |
| 6 | 24087 | 24.1% | |
| 7 | 21697 | 21.7% | |
| 8 | 4883 | 4.9% |
| Value | Count | Frequency (%) | |
| 15 | 267 | 0.3% | |
| 14 | 254 | 0.3% | |
| 13 | 368 | 0.4% | |
| 12 | 701 | 0.7% | |
| 11 | 2018 | 2.0% |
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.9709 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 11656 |
| Zeros (%) | 11.7% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 11 |
| 95-th percentile | 16 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.670421036 |
|---|---|
| Coefficient of variation (CV) | 0.8134417415 |
| Kurtosis | 18.68070925 |
| Mean | 6.9709 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.770070138 |
| Sum | 697090 |
| Variance | 32.15367473 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 11656 | 11.7% | |
| 1 | 10257 | 10.3% | |
| 2 | 6262 | 6.3% | |
| 10 | 6145 | 6.1% | |
| 4 | 6111 | 6.1% | |
| 5 | 5847 | 5.8% | |
| 6 | 5652 | 5.7% | |
| 8 | 5620 | 5.6% | |
| 9 | 5486 | 5.5% | |
| 11 | 5470 | 5.5% | |
| Other values (41) | 31494 | 31.5% |
| Value | Count | Frequency (%) | |
| 0 | 11656 | 11.7% | |
| 1 | 10257 | 10.3% | |
| 2 | 6262 | 6.3% | |
| 3 | 5146 | 5.1% | |
| 4 | 6111 | 6.1% |
| Value | Count | Frequency (%) | |
| 100 | 3 | < 0.1% | |
| 99 | 23 | < 0.1% | |
| 78 | 1 | < 0.1% | |
| 59 | 1 | < 0.1% | |
| 48 | 4 | < 0.1% |
DrivAge
Real number (ℝ≥0)
| Distinct | 80 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.61541 |
|---|---|
| Minimum | 18 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 33 |
| median | 43 |
| Q3 | 54 |
| 95-th percentile | 72 |
| Maximum | 99 |
| Range | 81 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 14.4932357 |
|---|---|
| Coefficient of variation (CV) | 0.3248482016 |
| Kurtosis | -0.3004203931 |
| Mean | 44.61541 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.488615178 |
| Sum | 4461541 |
| Variance | 210.0538811 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 50 | 2673 | 2.7% | |
| 34 | 2629 | 2.6% | |
| 30 | 2611 | 2.6% | |
| 51 | 2569 | 2.6% | |
| 48 | 2564 | 2.6% | |
| 35 | 2530 | 2.5% | |
| 36 | 2523 | 2.5% | |
| 49 | 2509 | 2.5% | |
| 45 | 2485 | 2.5% | |
| 33 | 2447 | 2.4% | |
| Other values (70) | 74460 | 74.5% |
| Value | Count | Frequency (%) | |
| 18 | 157 | 0.2% | |
| 19 | 415 | 0.4% | |
| 20 | 694 | 0.7% | |
| 21 | 870 | 0.9% | |
| 22 | 960 | 1.0% |
| Value | Count | Frequency (%) | |
| 99 | 66 | 0.1% | |
| 96 | 2 | < 0.1% | |
| 95 | 6 | < 0.1% | |
| 94 | 3 | < 0.1% | |
| 93 | 8 | < 0.1% |
BonusMalus
Real number (ℝ≥0)
| Distinct | 96 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.19731 |
|---|---|
| Minimum | 50 |
| Maximum | 230 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 50 |
| median | 50 |
| Q3 | 67 |
| 95-th percentile | 95 |
| Maximum | 230 |
| Range | 180 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 16.26036416 |
|---|---|
| Coefficient of variation (CV) | 0.2701177871 |
| Kurtosis | 3.277284078 |
| Mean | 60.19731 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.783297026 |
| Sum | 6019731 |
| Variance | 264.3994428 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 50 | 56083 | 56.1% | |
| 100 | 2879 | 2.9% | |
| 72 | 2741 | 2.7% | |
| 76 | 2702 | 2.7% | |
| 68 | 2694 | 2.7% | |
| 60 | 2688 | 2.7% | |
| 57 | 2670 | 2.7% | |
| 80 | 2665 | 2.7% | |
| 90 | 2617 | 2.6% | |
| 64 | 2587 | 2.6% | |
| Other values (86) | 19674 | 19.7% |
| Value | Count | Frequency (%) | |
| 50 | 56083 | 56.1% | |
| 51 | 2231 | 2.2% | |
| 52 | 754 | 0.8% | |
| 53 | 457 | 0.5% | |
| 54 | 2519 | 2.5% |
| Value | Count | Frequency (%) | |
| 230 | 1 | < 0.1% | |
| 228 | 1 | < 0.1% | |
| 198 | 1 | < 0.1% | |
| 196 | 1 | < 0.1% | |
| 195 | 4 | < 0.1% |
VehBrand
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.2 KiB |
| B1 | |
|---|---|
| B2 | |
| B12 | |
| B3 | |
| B5 | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| B1 | 27240 | 27.2% | |
| B2 | 26500 | 26.5% | |
| B12 | 16619 | 16.6% | |
| B3 | 8260 | 8.3% | |
| B5 | 6053 | 6.1% | |
| B6 | 4714 | 4.7% | |
| B4 | 3968 | 4.0% | |
| B10 | 2268 | 2.3% | |
| B13 | 1883 | 1.9% | |
| B11 | 1774 | 1.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.23265 |
| Min length | 2 |
VehGas
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.2 KiB |
| Regular | |
|---|---|
| Diesel |
| Value | Count | Frequency (%) | |
| Regular | 57255 | 57.3% | |
| Diesel | 42745 | 42.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.57255 |
| Min length | 6 |
Density
Real number (ℝ≥0)
| Distinct | 1490 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1605.39738 |
|---|---|
| Minimum | 2 |
| Maximum | 27000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 78 |
| median | 298 |
| Q3 | 1326 |
| 95-th percentile | 6595 |
| Maximum | 27000 |
| Range | 26998 |
| Interquartile range (IQR) | 1248 |
Descriptive statistics
| Standard deviation | 3848.224014 |
|---|---|
| Coefficient of variation (CV) | 2.397053877 |
| Kurtosis | 27.77036445 |
| Mean | 1605.39738 |
| Median Absolute Deviation (MAD) | 268 |
| Skewness | 4.950557239 |
| Sum | 160539738 |
| Variance | 14808828.07 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 27000 | 1490 | 1.5% | |
| 1313 | 1421 | 1.4% | |
| 405 | 1199 | 1.2% | |
| 9307 | 901 | 0.9% | |
| 4128 | 840 | 0.8% | |
| 57 | 766 | 0.8% | |
| 91 | 741 | 0.7% | |
| 473 | 720 | 0.7% | |
| 3744 | 716 | 0.7% | |
| 3317 | 711 | 0.7% | |
| Other values (1480) | 90495 | 90.5% |
| Value | Count | Frequency (%) | |
| 2 | 13 | < 0.1% | |
| 3 | 39 | < 0.1% | |
| 4 | 37 | < 0.1% | |
| 5 | 67 | 0.1% | |
| 6 | 103 | 0.1% |
| Value | Count | Frequency (%) | |
| 27000 | 1490 | 1.5% | |
| 23396 | 6 | < 0.1% | |
| 22821 | 25 | < 0.1% | |
| 22669 | 62 | 0.1% | |
| 21410 | 9 | < 0.1% |
Region
Categorical
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 781.2 KiB |
| R24 | |
|---|---|
| R82 | |
| R53 | |
| R11 | |
| R93 | |
| Other values (17) |
| Value | Count | Frequency (%) | |
| R24 | 30482 | 30.5% | |
| R82 | 13628 | 13.6% | |
| R53 | 8556 | 8.6% | |
| R11 | 8270 | 8.3% | |
| R93 | 7516 | 7.5% | |
| R52 | 7238 | 7.2% | |
| R54 | 3703 | 3.7% | |
| R72 | 3374 | 3.4% | |
| R91 | 2918 | 2.9% | |
| R31 | 2727 | 2.7% | |
| Other values (12) | 11588 | 11.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
| Distinct | 2581 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 92.5399367 |
|---|---|
| Minimum | 0 |
| Maximum | 100000 |
| Zeros | 95262 |
| Zeros (%) | 95.3% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 100000 |
| Range | 100000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1407.07132 |
|---|---|
| Coefficient of variation (CV) | 15.2050171 |
| Kurtosis | 3079.622955 |
| Mean | 92.5399367 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 50.42130238 |
| Sum | 9253993.67 |
| Variance | 1979849.701 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1128.12 | 1807 | 1.8% | |
| 1128 | 105 | 0.1% | |
| 564.06 | 59 | 0.1% | |
| 2256.24 | 44 | < 0.1% | |
| 75.32 | 18 | < 0.1% | |
| 564 | 13 | < 0.1% | |
| 2652.61 | 9 | < 0.1% | |
| 2256.25 | 8 | < 0.1% | |
| 100000 | 8 | < 0.1% | |
| Other values (2571) | 2667 | 2.7% |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1.49 | 1 | < 0.1% | |
| 4.09 | 1 | < 0.1% | |
| 4.6 | 1 | < 0.1% | |
| 5.08 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 100000 | 8 | < 0.1% | |
| 96422.32 | 1 | < 0.1% | |
| 85442.21 | 1 | < 0.1% | |
| 74966.23 | 1 | < 0.1% | |
| 74784.9 | 1 | < 0.1% |
| Distinct | 2854 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 456.3469753 |
|---|---|
| Minimum | 0 |
| Maximum | 10000000 |
| Zeros | 95262 |
| Zeros (%) | 95.3% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 10000000 |
| Range | 10000000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 33899.43191 |
|---|---|
| Coefficient of variation (CV) | 74.28433571 |
| Kurtosis | 75931.25865 |
| Mean | 456.3469753 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 261.537418 |
| Sum | 45634697.53 |
| Variance | 1149171484 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1253.466667 | 912 | 0.9% | |
| 2506.933333 | 45 | < 0.1% | |
| 626.7333333 | 31 | < 0.1% | |
| 2256.24 | 29 | < 0.1% | |
| 1504.16 | 24 | < 0.1% | |
| 1253.333333 | 22 | < 0.1% | |
| 1410.15 | 21 | < 0.1% | |
| 1709.272727 | 20 | < 0.1% | |
| 1359.180723 | 18 | < 0.1% | |
| Other values (2844) | 3616 | 3.6% |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1.655555556 | 1 | < 0.1% | |
| 4.544444444 | 1 | < 0.1% | |
| 5.111111111 | 1 | < 0.1% | |
| 5.644444444 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10000000 | 1 | < 0.1% | |
| 1825067.5 | 1 | < 0.1% | |
| 1666666.667 | 1 | < 0.1% | |
| 1564061.96 | 1 | < 0.1% | |
| 1196521.5 | 1 | < 0.1% |
| Distinct | 126 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1475403858 |
|---|---|
| Minimum | 0 |
| Maximum | 183 |
| Zeros | 95262 |
| Zeros (%) | 95.3% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 183 |
| Range | 183 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.039116697 |
|---|---|
| Coefficient of variation (CV) | 13.82073584 |
| Kurtosis | 3404.476253 |
| Mean | 0.1475403858 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 51.04192076 |
| Sum | 14754.03858 |
| Variance | 4.157996905 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1.111111111 | 2023 | 2.0% | |
| 2.222222222 | 118 | 0.1% | |
| 2 | 70 | 0.1% | |
| 1.333333333 | 67 | 0.1% | |
| 3.03030303 | 48 | < 0.1% | |
| 1.204819277 | 47 | < 0.1% | |
| 1.149425287 | 47 | < 0.1% | |
| 1.351351351 | 47 | < 0.1% | |
| 1.515151515 | 46 | < 0.1% | |
| Other values (116) | 2225 | 2.2% |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1.111111111 | 2023 | 2.0% | |
| 1.123595506 | 26 | < 0.1% | |
| 1.136363636 | 13 | < 0.1% | |
| 1.149425287 | 47 | < 0.1% |
| Value | Count | Frequency (%) | |
| 183 | 3 | < 0.1% | |
| 122 | 6 | < 0.1% | |
| 100 | 11 | < 0.1% | |
| 50 | 14 | < 0.1% | |
| 33.33333333 | 18 | < 0.1% |
| Distinct | 2586 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.83762728 |
|---|---|
| Minimum | 0 |
| Maximum | 100000 |
| Zeros | 95262 |
| Zeros (%) | 95.3% |
| Memory size | 781.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 100000 |
| Range | 100000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1324.212353 |
|---|---|
| Coefficient of variation (CV) | 15.07568446 |
| Kurtosis | 3273.256454 |
| Mean | 87.83762728 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 51.65522984 |
| Sum | 8783762.728 |
| Variance | 1753538.356 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1128.12 | 1847 | 1.8% | |
| 1128 | 105 | 0.1% | |
| 564.06 | 59 | 0.1% | |
| 75.32 | 18 | < 0.1% | |
| 564 | 13 | < 0.1% | |
| 2652.61 | 9 | < 0.1% | |
| 2256.25 | 7 | < 0.1% | |
| 69.65 | 6 | < 0.1% | |
| 100000 | 6 | < 0.1% | |
| Other values (2576) | 2668 | 2.7% |
| Value | Count | Frequency (%) | |
| 0 | 95262 | 95.3% | |
| 1.49 | 1 | < 0.1% | |
| 4.09 | 1 | < 0.1% | |
| 4.6 | 1 | < 0.1% | |
| 5.08 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 100000 | 6 | < 0.1% | |
| 96422.32 | 1 | < 0.1% | |
| 85442.21 | 1 | < 0.1% | |
| 74966.23 | 1 | < 0.1% | |
| 74784.9 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| IDpol | ClaimNb | Exposure | Area | VehPower | VehAge | DrivAge | BonusMalus | VehBrand | VehGas | Density | Region | ClaimAmount | PurePremium | Frequency | AvgClaimAmount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 0 | 0.10 | D | 5 | 0 | 55 | 50 | B12 | Regular | 1217 | R82 | 0.0 | 0.0 | 0.0 | 0.0 |
| 1 | 3 | 0 | 0.77 | D | 5 | 0 | 55 | 50 | B12 | Regular | 1217 | R82 | 0.0 | 0.0 | 0.0 | 0.0 |
| 2 | 5 | 0 | 0.75 | B | 6 | 2 | 52 | 50 | B12 | Diesel | 54 | R22 | 0.0 | 0.0 | 0.0 | 0.0 |
| 3 | 10 | 0 | 0.09 | B | 7 | 0 | 46 | 50 | B12 | Diesel | 76 | R72 | 0.0 | 0.0 | 0.0 | 0.0 |
| 4 | 11 | 0 | 0.84 | B | 7 | 0 | 46 | 50 | B12 | Diesel | 76 | R72 | 0.0 | 0.0 | 0.0 | 0.0 |
| 5 | 13 | 0 | 0.52 | E | 6 | 2 | 38 | 50 | B12 | Regular | 3003 | R31 | 0.0 | 0.0 | 0.0 | 0.0 |
| 6 | 15 | 0 | 0.45 | E | 6 | 2 | 38 | 50 | B12 | Regular | 3003 | R31 | 0.0 | 0.0 | 0.0 | 0.0 |
| 7 | 17 | 0 | 0.27 | C | 7 | 0 | 33 | 68 | B12 | Diesel | 137 | R91 | 0.0 | 0.0 | 0.0 | 0.0 |
| 8 | 18 | 0 | 0.71 | C | 7 | 0 | 33 | 68 | B12 | Diesel | 137 | R91 | 0.0 | 0.0 | 0.0 | 0.0 |
| 9 | 21 | 0 | 0.15 | B | 7 | 0 | 41 | 50 | B12 | Diesel | 60 | R52 | 0.0 | 0.0 | 0.0 | 0.0 |
Last rows
| IDpol | ClaimNb | Exposure | Area | VehPower | VehAge | DrivAge | BonusMalus | VehBrand | VehGas | Density | Region | ClaimAmount | PurePremium | Frequency | AvgClaimAmount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99990 | 1019536 | 0 | 0.90 | B | 9 | 7 | 54 | 50 | B1 | Regular | 57 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99991 | 1019539 | 0 | 0.90 | A | 7 | 12 | 46 | 50 | B2 | Regular | 26 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99992 | 1019541 | 0 | 0.90 | E | 6 | 6 | 64 | 50 | B2 | Diesel | 3233 | R82 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99993 | 1019543 | 0 | 0.90 | D | 7 | 3 | 48 | 50 | B3 | Regular | 593 | R82 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99994 | 1019545 | 0 | 0.90 | B | 7 | 5 | 51 | 50 | B5 | Regular | 51 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99995 | 1019549 | 0 | 0.49 | C | 7 | 14 | 74 | 50 | B1 | Regular | 182 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99996 | 1019550 | 0 | 0.50 | C | 7 | 14 | 74 | 50 | B1 | Regular | 182 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99997 | 1019552 | 0 | 0.90 | C | 9 | 25 | 41 | 50 | B2 | Regular | 182 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99998 | 1019554 | 0 | 0.90 | C | 7 | 9 | 44 | 50 | B1 | Regular | 191 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |
| 99999 | 1019556 | 0 | 0.90 | E | 4 | 12 | 53 | 50 | B1 | Regular | 4116 | R24 | 0.0 | 0.0 | 0.0 | 0.0 |